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Abstract — Gossip and in particular network coded algebraic 
gossip have recently attracted attention as a fast, bandwidth- 
efficient, reliable and distributed way to broadcast or multicast 
multiple messages. While the algorithms are simple, involved 
queuing approaches are used to study their performance. The 
most recent result in this direction shows that uniform algebraic 
gossip disseminates k messages in 0(A(D + k + logn)) rounds 
where D is the diameter, n the size of the network and A the 
maximum degree. 

In this paper we give a simpler, short and self-contained 
proof for this worst-case guarantee. Our approach also allows to 
reduce the quadratic AD term to min{3ro, AD}. We furthermore 
show that a simple round robin routing scheme also achieves 
min{3n, AD} + Ak rounds, eliminating both randomization and 
coding. Lastly, we combine a recent non-uniform gossip algorithm 
with a simple routing scheme to get a 0(D + k + log°^>) gossip 
information dissemination algorithm. This is order optimal as 
long as D and k are not both polylogarithmically small. 



I. Introduction 

Broadcast and multicast are fundamental communication 
primitives with many practical applications like the mainte- 
nance of distributed databases or content distribution networks. 
With modern networks increasing in size they often become 
less reliable and behave more like distributed systems. This 
increases the need for reliable ways to disseminate information 
without knowing the network topology. 

Gossip protocols in which nodes forward information to 
randomly chosen neighbors have been developed as a powerful 
new paradigm in this direction. Their randomized approach has 
been shown to disperse information quickly in many networks 
while keeping the total number of messages and the congestion 
on any particular link small. 

When more information than can be fit in a packet is to be 
distributed selecting the right piece of information to forward 
can be hard. Algebraic gossip, an adaptation of random linear 
network coding, has been proposed as a solution and it was 
shown that in a complete network it outperforms routing JTJ. 
Subsequent works like J2J, Gl, |@| extend this idea to general 
networks. These works also follow the approach of JTJ to 
understand the performance of algebraic gossip by analyzing 
the queuing behavior of innovative packets. The most recent 
work in this line J3|, HJ use Jackson's theorem to study 
the worst-case performance of algebraic gossip in terms of 
network diameter and maximum degree. This will also be the 
main focus of this paper. 

We show that the projection analysis technique of J5] 
provides easier means to understand algebraic gossip. With 
Theorem [l] we give a simple, short and self-contained proof 
for the main result in HI, which itself generalizes the main 
result in In Theorem we then further tighten Theorem [I] 
Next, we observe that the worst-case performance of algebraic 



gossip can also be matched by uniform gossip without coding 
(Theorem^) or even a simple deterministic round robin routing 
protocol (Theorem^. Appealing to the optimality of algebraic 
gossip [6 1 leads then to yet another alternative proof for 
Theorems [TJ and [2] Lastly, we follow the idea of (4) to bridge 
the gap to more structured broadcast protocols by giving an 
efficient non-uniform gossip protocol. Our Theorem 13 shows 
that this protocol achieves an optimal performance in any 
network up to an additive polylogarithmic term. All previous 
algorithms had gaps of up to 0(n/logn). 

II. Gossip Algorithms and the Multicast Problem 

As in HJ we assume a network to be modeled by an 
undirected graph G — (V,E) with n = \V\ nodes, node 
degree A M for u G V, maximum degree A = max M A u 
and diameter D. During a gossip protocol nodes communicate 
in synchronous rounds in which each node can initiate one 
bidirectional exchange of a packet with one neighbor. The 
goal of the information dissemination, rumor spreading or 
k-message multicast task is to spread k initially distributed 
messages to all nodes as fast as possible and with high 
probability (whp), that is, with probability at least 1 — 1/n. 
We assume that the messages have equal size and that a 
packet can fit one message plus some header information 
(which becomes negligibly small for large message/packet 
sizes). A gossip solution to the multicast problem specifies 
both, which node contacts which in each round and what is 
sent in each transmitted packet. The most studied instantiations 
for these choices are uniform gossip, that is, each node contacts 
a uniformly random neighbor in each round, and algebraic 
gossip, that is, nodes perform random linear network coding 
on the messages to create new packets. In random linear 
network coding nodes send out random linear combinations 
of messages (over some finite field GF(q)) together with a 
vector containing all coefficients used. Each new packet is 
then simply created as a uniformly random linear combination 
of already received packets or initially known messages; to 
decode, nodes perform Gaussian elimination. 

III. A Simpler and Stronger Proof for Uniform 
Algebraic Gossip 

The main result in HJ is the following worst-case guarantee 
on the performance of algebraic gossip in any network: 

Theorem 1. Uniform algebraic gossip over GF(2) dissemi- 
nates k messages whp in no more than 0(A(D + k + log n)) 
time. 

The proof is somewhat involved and uses Jackson's queuing 
theorem. Here, we show that one can alternatively obtain the 
same result using the projection analysis of 0. In particular, 
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one can essentially get it by setting the min-cut probability 7 
in (5] Lemma 6] to be 1/A. Next, we give a simple, short and 
completely self-contained proof based on J5] Theorem 3]: 

Proof: We say a node knows a coefficient vector \i 6 
{0, l} fc if it has received a packet with a coefficient vector 
that is non-perpendicular to p, (over GF(2)). We claim that 
for any non-zero vector p the probability that any fixed node 
v does not learn u within 0(A(D + k + logn)) rounds is at 
most 2 - ( fe+21og "). Then, a union bound over all nodes and 
all 2 k coefficient vectors shows that whp all nodes know all 
vectors. From this it is easy to conclude that all nodes can 
decode. 

To prove this claim we look at a shortest path P from v 
to a node that initially knows p (that is, that starts with a 
message with a non-zero coefficient in p). At any round t let 
node u be the closest node to v on P that knows p. There is 
a 1/A„ chance that u contacts the next node on the path and 
independently a chance of 1/2 that u sends out a packet with a 
coefficient vector that is non-perpendicular to p. Thus, in any 
round independently with probability at least ^ knowledge 
of p makes progress on P. A Chernoff bound shows that the 
probability that less than D progress is made in 16A(£> + fc + 
logn) rounds when 8(D + k + logn) successes are expected 
is at most 2"C £+21o s") as claimed. ■ 

Note, that even for constant k Theorem [l] results in a 
quadratic bound of AD = 0(n 2 ) for many networks. Next, 
we tightening the bound of Theorem [l] and show that small 
number of messages (e.g., k = 0(log n)) never take more than 
O(nlogn) time: 

Theorem 2. Uniform algebraic gossip disseminates k mes- 
sages whp in at most 0(min{n, AD} + A(k + logn)) time. 

Proof: In the proof of Theorem Q] let X%, X2, ... be the 
number of rounds spent for knowledge about p to successfully 
make the zth step along P towards v and let X = ^\ Xi 
be the total number of rounds needed. Note that each Xi 
is an independent geometric random variable with success 
probability ^- and thus E[X] = ^ E[X t ] = £ ueP 2A U < 
2D A. Since P is a shortest path, the sum of node degrees 
along P is at most 3n because every node can be adjacent 
to at most three consecutive nodes on P without creating 
a shortcut. We thus also have E[X] < 6n. By applying 
Markov's inequality to (1 — 1/A)~ 2X one can show that 
P[X > 2{E[X] + t)] < (1 - 1/A)-*. Using this with 
t = A(k + logn) replaces the Chernoff bound argument in 
the second paragraph of the proof of Theorem [l] ■ 

IV. Gossip without Coding or Randomization 

Among the main motivations for using randomized gossip 
is that its randomization adds robustness and often avoids 
bottlenecks in the network. In regular, well-connected net- 
works this allows for much faster dissemination times than 
the bound given in Theorem Q Adding coding on top of this 
(in particular, network coded algebraic gossip) allows mixing 
and efficient diffusion of information which often significantly 
increases throughput [5|. 

In this section, we show that if one just wants to achieve the 
worst-case running time of Theorem |2| neither randomization 



nor coding is needed. Instead the following simple determin- 
istic round robin routing achieves the same performance (with 
no probability of failure): Each node u repeatedly contacts 
its neighbors such that every neighbor is contacted at least 
every (order) A u rounds. A node furthermore can forward any 
packet as long as the same packet is not forwarded twice to 
the same node. One simple protocol that does this efficiently 
is prioritized round robin routing. In this protocol each node 
fixes a cyclic order of its neighbors and simply contacts the 
next node in its list in every round. Each node u furthermore 
keeps a counter how often it has sent out each packet and 
sends out the lexicographically first packet it has not sent out 
A„ often already (if such a packet exists). 

Theorem 3. (Prioritized) round robin routing disseminates all 
k messages in at most min{3n, AD} + Afc time. 

Proof: Throughout the proof we use the following notion 
of distance between any two nodes u and v: Let dist(u, v) be 
the smallest number such that there is a path P between u and 
v with J2wep-{v} ^™ — dist(u,v). Note that dist(u,v) < 
min{3n, AD} as shown in the proof of Theorem^ We claim 
that for any nodes u,v, any message m and any i > if node 
u knows message m initially then at time t = dist(u, v) + Ai 
node v will either know message m or at least i + 1 messages 
smaller than m. Note that this claim also implies the theorem 
since at time min{3n, AD} + Ak any node has either received 
every message m or k (smaller) messages. 

We prove the claim by induction on t. Nothing needs to be 
shown for t = 0. For the induction step consider any u, v, m 
and t = dist(u, v) + Ai and let v' be the neighbor of v on 
a shortest distance path from u to v. By induction hypothesis 
we get that for any j < i node v' at time d(u, v') + Aj either 
knows m or j+1 messages smaller than m. From this it is clear 
that at time d(u,v') + A' v — d{u,v) node v' has sent either 
message m or one message smaller than m to v and similarly 
for any j < i at time d(u, v') + Aj + A v = d(u, v) + Aj node 
v has sent either m or j + 1 messages to v that are smaller 
than m. For j = i this is exactly the claim that needed to be 
proved. ■ 

We remark that a similar performance guarantee also holds 
for randomized uniform gossip. It can be obtained along the 
same line as the proof for Theorem We state the result next 
but omit the proof. 

Theorem 4. Prioritized uniform gossip whp routes all mes- 
sages to all nodes in 0(min{n, AD} + A(k + logn)) time. 

Given that these two simple routing schemes achieve a 
good worst-case performance one would expect that adding 
algebraic gossip on top of it should not harm the performance. 
Indeed we can make this formal and appeal to the optimality of 
algebraic gossip 1 6 [to give yet another simple and alternative 
proof for Theorem 2 

Proof of Theorem 0-- In contrast to before we require 
the field size used for coding to be at least q = n 2 which 
gives rise to 21ogn-size coefficients. In J6| it is shown 
that with probability 1 — n/q algebraic gossip completes in 
exactly optimal time, that is, at the first time it is possible 
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in hindsight to route all messages to each node individually. 
Furthermore, Theorem^ shows how to route the messages whp 
in 0(min{3n, AD} + A(k + logn)) rounds via the uniform 
gossip exchanges. Thus with probability 1 — n/q — 1/n = 
1 — 2/n uniform algebraic gossip also completes in this time. 



unreliable topologies is one of the main motivations behind 
pursuing randomized gossip approaches. Some work in this 
direction can be found in |5| but any further approaches to 
formalize and study dynamic topologies and the reliability of 
the algorithms presented here is an important open problem 
of interest. 



V. Faster Non-Uniform Gossip 

Lastly, [4] shows that fast non-uniform gossip protocols can 
be obtained by first running a 1 -message broadcast gossip 
algorithm and then performing algebraic gossip along an 
induced spanning-tree. More specifically, for any broadcast 
algorithm with running time B (and given a leader) this leads 
to a 0(B + k + logn) solution to the fc-message multicast 
problem considered here. Instantiating this with the broadcast 
protocol from [7| it is shown that an Q(D + k + logn) 
algebraic gossip protocol is possible for graphs with weak 
conductance <D. Unfortunately, there are graphs for which 
<P _1 log n = 8(n) even so the network diameter is logarithmic 
(for example, a balanced binary tree). For these graphs a gap of 
up to 0(n/ logn) between the upper bound and the fl(D + k) 
lower bound remains. The following lemma improves upon 
this using [8 1, a recent strengthening of Q: 

Theorem 5. There is a gossip algorithm that whp routes k 
messages to all nodes in 0(D + k + log *- 1 -*) time in any 
network. 

Proof: In [8] a broadcast gossip algorithm with time 
0(D + log ' 1 -*) is given. We first run this algorithm with 
each node using its ID as a message. After completion every 
node declares the node it first learned the smallest ID from as 
its parent (breaking ties arbitrarily). This induces a spanning 
tree. In all future rounds, each node initiates a bidirectional 
exchange with its parent forwarding any message not sent by it 
before (if such a message exists). A standard pipelining proof 
(similar to Theorem |3|) shows that after 0(k + D') time all 
nodes know about all messages, where D 1 is the diameter of 
the tree. Since a message can travel at most one step per round 
during the initial broadcast we have D' = 0(D + log° (1) ) 
which completes the proof. ■ 
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VI. Conclusion 

We have given simple, short and tighter proofs for the worst- 
case performance of algebraic gossip using the projection 
analysis technique of [5|. This improves over the recent 
results in [4|. We could furthermore show that this worst- 
case performance is also met by a simple round robin routing 
scheme that eliminates both randomness and coding. A similar 
routing scheme has been studied intensively under the term 
quasirandom rumor spreading |]9] and has been shown to 
perform surprisingly well in many topologies. Lastly, bridging 
the gap to more structured broadcast protocols we have shown 
a non-uniform gossip protocol that achieves order optimal (up 
to an additive polylogarithmic term) distributed information 
spreading. All these results do not take any unreliability 
or changes in the network topology into account. This is 
unfortunate since fault-tolerance with respect to dynamic or 



